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Abstract: The idea that the existence of a consistent UV completion satisfying the fun- 
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damental axioms of local quantum field theory or string theory may impose positivity 
constraints on the couplings of the leading irrelevant operators in a low-energy effective 
field theory is critically discussed. Violation of these constraints implies superluminal prop- 
agation, in the sense that the low- frequency limit of the phase velocity u p h(0) exceeds c. It 
is explained why causality is related not to i> p h(0) but to the high-frequency limit u p h(oo) 
and how these are related by the Kramers-Kronig dispersion relation, depending on the 
Q_i! sign of the imaginary part of the refractive index Im n(u) which is normally assumed pos- 

itive. Superluminal propagation and its relation to UV completion is investigated in detail 
in three theories: QED in a background electromagnetic field, where the full dispersion 
relation for n(uj) is evaluated numerically for the first time and the role of the null energy 
condition T fll/ k fJ, k u > is highlighted; QED in a background gravitational field, where ex- 
amples of superluminal low-frequency phase velocities arise in violation of the positivity 
constraints; and light propagation in coupled laser-atom A-systems exhibiting Raman gain 
lines with Im n(u) < 0. The possibility that a negative Im n(uS) must occur in quantum 
field theories involving gravity to avoid causality violation, and the implications for the 
relation of IR effective field theories to their UV completion, are carefully analysed. 
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1. Introduction 

Effective field theories have always played an important role in particle physics as low- 
energy phenomenological models describing the IR dynamics of renormalisable quantum 
field theories [1]. Key examples include the chiral Lagrangians describing the light pseu- 
doscalar mesons arising from chiral symmetry breaking in QCD or those related to elec- 
troweak symmetry breaking in the absence of a light Higgs boson. More recently, the 
necessity of renormalisability as a criterion in constructing unified field theories has been 
reconsidered and it has become familiar to think of the standard model and its supersym- 
metric generalisations as IR effective theories of some more fundamental QFT or string 
theory at the Planck scale. 

The question then arises whether all effective field theories admit a consistent UV 
completion, i.e. whether they can arise as the IR limit of a well-defined renormalisable 
QFT or string theory. If not, we may be able to use the criterion of the existence of a UV 
completion as a constraint on the parameters of the IR effective field theory, perhaps with 
interesting phenomenological implications. 

At first sight, the answer certainly appears to be no. It is well-known that the pa- 
rameters of chiral Lagrangians derived from known renormalisable theories such as QCD 
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satisfy constraints arising from analyticity and unitarity. However, in general the situation 
may not be so straightforward, particularly when we consider theories involving gravity. 
The purpose of this paper is to expose some of the subtleties that arise in establishing 
the relation between constraints on IR effective field theories and their UV completion, 
especially those that arise from considerations of causality and the absence of superluminal 
propagation. 

The idea that the existence of a well-defined UV completion may place important 
constraints on the couplings of the leading irrelevant operators in low-energy effective 
actions has been revisited recently in ref.[2]. This paper highlights a number of examples 
where these couplings must satisfy certain positivity constraints to ensure that the effective 
theory does not exhibit unphysical behaviour such as superluminal propagation of massless 
particles or violations of analyticity or unitarity bounds on scattering amplitudes. This 
has been developed in ref.[3], where positivity constraints are derived on the couplings of 
the operators controlling WW scattering in an effective theory of electroweak symmetry 
breaking. Turning the argument around, these papers claim that an observed violation of 
these positivity constraints would signify a breakdown of some of the fundamental axioms 
of QFT and string theory, such as Lorentz invariance, unitarity, analyticity or causality. 

In this paper, we examine these arguments critically and show that the assumed rela- 
tion between the IR effective field theory and its UV completion may be significantly more 
subtle. We focus on the issue of superluminal propagation in the effective theory. This 
makes contact with our extensive body of work (see e.g. [4, 5, 6, 7, 8, 9, 10]) on superlumi- 
nality, causality and dispersion in QED in curved spacetime, a subject we have given the 
name of 'quantum gravitational optics' [9]. As will become clear, our discussion is equally 
applicable to the issue of analyticity bounds, since each ultimately rests on the behaviour 
of the imaginary (absorptive) part of the refractive index or forward scattering amplitude. 

We begin by discussing the general form of the effective action for low-energy elec- 
trodynamics and derive positivity constraints on the couplings of the leading irrelevant 
operators based on the requirement that light propagation is not superluminal. However, 
the 'speed of light' obtained from the IR effective theory is simply the zero-frequency limit 
of the phase velocity, v p h(0). In section 3, we present a careful analysis of dispersion and 
light propagation and show that in fact the 'speed of light' relevant for causality is t> p h(oo), 
i.e. the high-frequency limit of the phase velocity. Determining this requires a knowledge 
of the UV completion of the quantum field theory. 

It therefore appears that causality and superluminal propagation place no restriction 
on the low-energy theory. However, on the basis of the usual axioms of quantum field 
theory (local Lorentz invariance, analyticity, microcausality) we can prove the Kramers- 
Kronig dispersion relation for the refractive index: 



It is usually assumed that Im n(uj) > 0, typical of an absorptive medium. (Similar rela- 
tions hold for scattering amplitudes. Here, the imaginary part of the forward scattering 
amplitude, Im Ai(s, 0), is related via the optical theorem to the total cross section, ensur- 
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ing positivity.) If this is true, then eq.(l.l) implies that v p h(oo) > v p h(0). It follows that 
if Vph(O) is found to be superluminal in the low-energy theory then v p h(oo) is necessarily 
also superluminal, in contradiction with causality Violation of the positivity constraints 
on the couplings in the IR effective action would indeed then mean that the theory had 
no consistent UV completion. The central point of this paper, however, is to question the 
assumption that Im n(u) is necessarily positive, especially in theories involving gravity. 

First, we consider an example that is under complete control and where the conven- 
tional expectations are realised, viz. QED in a background electromagnetic field. The 
background field modifies the propagation of light due to vacuum polarisation and induces 
irrelevant operators in the low-energy effective action. Recent technical developments mean 
that we now have an expression for the vacuum polarisation valid for all momenta for gen- 
eral constant background fields [11] and in section 4 we use this to derive the full frequency 
dependence of the refractive index n(cj). This is the first complete numerical evaluation of 
the QED dispersion relation. This shows the expected features - a single absorption line 
with Im n(uj) > and the corresponding phase velocity satisfying u p h(oo) > v p h(0). The 
low-energy couplings, which determine the Euler-Heisenberg effective action, are consistent 
with the positivity constraints following from imposing i> p h(0) < l. 1 

An important feature of our analysis is the demonstration of the precise role of the 
null energy condition T^ v k^k v > 0, where is the photon momentum. (See also, e.g., 
refs. [5, 12, 13].) The null projection of the energy-momentum tensor occurs universally 
in the dispersion relation for light propagation in background fields - the speed of light is 
determined by both the couplings and the sign of T^ v k^k v . This is greatly clarified by our 
use of the Newman- Penrose, or null-tetrad, formalism, which is by far the clearest language 
in which to analyse the propagation of light in general backgrounds. 

We then consider the analogous case of QED in a background gravitational field, i.e. in 
curved spacetime. Here, current techniques only allow a determination of the refractive 
index and phase velocity in the low-frequency region. Moreover, the null-energy projection 
T^ v k^k v (the Newman- Penrose scalar <3?oo) is not the only quantity determining the speed 
of light. There is also a polarisation-dependent contribution given by the Weyl tensor pro- 
jection C^\ vp k x k p a^a u (essentially the NP scalar \l/o), which gives rise to the phenomenon 
of gravitational birefringence. The remarkable feature of QED in curved spacetime is that, 
even for Weyl-flat spacetimes, the positivity constraints on the low-energy couplings which 
would prohibit superluminal propagation are violated [4]. We can readily find examples 
exhibiting a superluminal low-frequency phase velocity, f p h(0) > 1. Furthermore, for Ricci- 
flat spacetimes it is possible to prove a simple sum rule [5] which shows that one polarisation 
is always superluminal, whatever the sign of the couplings. We illustrate these phenomena 
with examples of propagation in FRW and black- hole spacetimes in section 5. 

This raises the central issue of this paper. If the KK dispersion relation holds in grav- 
itational backgrounds and Im n(uj) > 0, then necessarily u p h(oo) > f p h(0) > 1 for certain 
spacetimes, in apparent violation of causality. (See ref.[8] and sections 3 and 5 for a more 
nuanced discussion). The most natural escape seems to be that for theories involving grav- 

1 Note that in general we set c = 1 throughout this paper. 
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ity we must have the possibility that Im n{u) < 0. But if so, this invalidates the conclusion 
that IR effective field theories with couplings violating the positivity constraints necessarily 
have no consistent UV completion. Once we admit the possibility that a fundamental QFT 
satisfying the standard axioms (and surely QED in curved spacetime is such a theory) can 
have a negative absorptive part, Im n(oj) < or Im M(s, 0) < 0, then we must accept that 
the relation between the IR effective theory and its UV completion can be rather more 
subtle. 

In order to understand better what a negative Im n(uj) would mean physically, in 
section 6 we introduce a fascinating topic in atomic physics - the interaction of lasers with 
so-called 'A-systems' exhibiting electromagnetically-induced transparency (EIT) and 'slow 
light'. We discuss a variant in which the A-system is engineered to produce Raman gain 
lines [14] rather than absorption lines in Im n{oo) and derive in detail an explicit example 
where f p h(oo) < f p h(0). This occurs as a result of the intensity gain of a light pulse 
propagating through the coupled laser-atom medium. Our conjecture is that gravity may 
be able to act in the same way. 

The issue of whether the existence of a consistent UV completion does indeed impose 
positivity constraints on the couplings of an IR effective field theory therefore remains open. 
In particular, theories involving gravity present a serious challenge to the accepted wisdom 
on the relation of the IR and UV behaviour of QFT. A summary of our final conclusions 
is presented in section 7. 



- 4 - 



2. Superluminality constraints in low-energy quantum electrodynamics 

The low-energy effective action for QED, valid for momenta below the scale of the electron 
mass m, is given by the well-known Euler-Heisenberg Lagrangian, 

£= -\w + ^{-h {F ^ )2 + Io F ^ FxpF " XFUP ) (2 - 1} 

This is more conveniently written in terms of the two Lorentz scalars quadratic in the field 
strengths, T = {F^F^ and Q = {F^F^, as follows: 

where c\ = 4 and C2 = 7. 

The higher-order terms omitted in eqs.(2.1) or (2.2) are of two types: higher orders in 
derivatives, i.e. terms of 0(D 2 F 4 /m 6 ) which give contributions to the photon dispersion 
relation suppressed by 0(k 2 /m 2 ) 2 , and higher orders in the field strengths J 7 , Q which are 
associated with higher powers of a. 

In this section, we consider photon propagation in a background electromagnetic field 
using this effective Lagrangian and discuss the constraints imposed on the coefficients 
c±,C2 of the leading (dim 8) irrelevant operators in the low-energy expansion (2.2) by the 
requirement that propagation is causal, i.e. subluminal. 

The simplest way to study the causal nature of photon propagation is to use geometric 
optics (see, e.g. refs.[8, 9] for reviews of our formalism). Since we shall be considering 
QED in a background gravitational field later in this paper, the following brief summary 
of geometric optics is sufficiently general to include the case of propagation in an arbitrary 
spacetime metric g^ u . The electromagnetic field in the (modified) Maxwell equations de- 
rived from eq.(2.2) is written in terms of a slowly- varying amplitude (and polarisation) and 
a rapidly- varying phase as follows: 

(^ + ieB^ + . . .)e^ (2.3) 

where e is a parameter introduced to keep track of the relative orders of terms as the 
Maxwell and Bianchi equations are solved order- by-order in e. The wave vector is identified 
as the gradient of the phase, = d^-d. We also write = Aa^, where A represents the 
amplitude itself while a M specifies the polarisation. 

Solving the usual Maxwell equation D jl F ilv = 0, we find at 0(1/ e), 

k 2 = (2.4) 

while at O(l), 

k^D^a" = (2.5) 

and 

k»D lx (lnA) = --D^ (2.6) 

2 Here, we are using 'fc 2 ' to denote two powers of momentum, not necessarily the contracted form 
which can of course be zero on-shell even for large values of \k\. Similarly for 'D 2 . 
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Eq.(2.4) shows immediately that fc M is a null vector. From its definition as a gradient, we 
also see 

WD^X = k»D u k^ = -D u k 2 = (2.7) 

Light rays, or equivalently photon trajectories, are the integral curves of i.e. the curves 
x fl (s) where dx^/ds = k^ . These curves therefore satisfy 

_ ,11 - , d x ,, dx^ dx , _ 

= ^ = ^+1^- — (2.8) 

This is the geodesic equation. We conclude that for the usual Maxwell theory in general 
relativity, light follows null geodesies. Eqs.(2.7),(2.5) show that both the wave vector and 
the polarisation are parallel transported along these null geodesic rays, while eq.(2.6), 
whose r.h.s. is just (minus) the optical scalar 6, shows how the amplitude changes as the 
beam of rays focuses or diverges. 

As a consequence of only keeping terms in the effective action with no explicit deriva- 
tives acting on the field strengths, the modified light-cone condition derived from the low- 
energy effective action (2.2) remains homogeneous and quadratic in k^ . It can therefore 
be written in the form: 

Q^k^K = (2.9) 

where Q^ v is a function of the background electromagnetic (or gravitational) field. In the 
gravitational case, Q^ u also involves the photon polarisation. 

Now notice that in the discussion of the free Maxwell theory, we did not need to 
distinguish between the photon momentum p* 1 , i.e. the tangent vector to the light rays, 
and the wave vector k^ since they were simply related by raising the index using the 
spacetime metric, pP = g^ u k v . In the modified theory, however, there is an important 
distinction. The wave vector, defined as the gradient of the phase, is a covariant vector or 
1-form, whereas the photon momentum/tangent vector to the rays is a true contravariant 
vector. The relation is non-trivial. In fact, given k^, we should define the corresponding 
'momentum' as 

p » = g^k u (2.10) 

and the light rays as curves x^(s) where ^ = p^. This definition of momentum satisfies 

G^pV = g^k„k v = (2.11) 

where G = Q" 1 defines a new effective metric which determines the light cones mapped 
out by the geometric optics light rays. (Indices are always raised or lowered using the 
true metric g^u-) The ray velocity t> ray corresponding to the momentum p^, which is the 
velocity with which the equal-phase surfaces advance, is given by (defining components in 
an orthonormal frame) 

ray ~ p°~ dt [ ] 

along the ray. This is in general different from the phase velocity 

- m <2 - 13) 



-6- 



where the frequency uj = k . 

This shows that photon propagation for low-energy QED in a background electromag- 
netic or gravitational field (i.e. curved spacetime) can be characterised as a bimetric theory 
- the physical light cones are determined by the effective metric G pv while the geometric 
null cones are fixed by the spacetime metric g pu . 

This also clarifies a potentially confusing point. In the perturbative situation we are 
considering here, where the effective Lagrangian L(T ,Q) = —J- + 0(a) and the effective 
metric G pu = g pv + 0(a), subluminal propagation with v p ^ < 1 corresponds to k 2 < 0, 
i.e. the wave-vector is spacelike. However, the analysis above makes it clear that in this 
case the photon momentum is nevertheless timelike, i.e. p 2 > if t> p h < 1 (and v ray < 1). 
For further discussion of this point and an illustration in the case of QED in a background 
FRW spacetime, see ref.[8]. 

Now consider a general low-energy effective action L(T,Q). The modified Maxwell 
equation is 

4F^F^ + 4;fyF^+(£^^ = 

(2.14) 

while the Bianchi identity remains 

d p F^ v = (2.15) 

We now immediately restrict to the case of interest here, viz. C(J-,Q) = —J- + 0(a). 
It would be trivial to extend the results of this section to this more general case, which 
includes the important example of Born-Infeld theory. 3 Applying geometric optics, we then 
find 

k 2 a v - k.ak u - C^k p F^ 'F Xp k x a p - Cggk^F^ 'F Xp k X a p 

-C T g(k p F^F Xp k x a p + k p F pu F Xp k x a p ) = (2.16) 

where the field strengths now refer to the background electromagnetic field. 

The natural language to discuss photon propagation in background fields is the Newman- 
Penrose (NP), or null tetrad, formalism (see appendix A). At each point in spacetime, 
we establish a tetrad of null vectors £ p ,n u ,m fl ,fh p- such that i.n = l,m.m = —1 with 
all others zero. We choose to lie along the (unperturbed) direction of propagation, 
i.e. k p = \/2uji^ + 0(a), where w is normalised to be the frequency. The polarisation 
vector is then 

= am p + /3m M (2.17) 

where m^(m^) corresponds to left (right) circular polarisation, i.e. photon helicity +1(— 1). 

Substituting for the polarisation a p in eq.(2.16) and contracting successively with m v 
and m u then yields the equations: 

k 2 a + 2io 2 ( K CrrF pv F Xp + CggF pv F Xp + Crg(F pv F Xp + F pv F Xp )y = 

(2-18) 

3 The essential simplification we make is that we can drop the factor Lt in front of d^F^ since the 
latter is already O(a) by virtue of the equations of motion, so we should approximate Lr — 1 when we 
work consistently to 0(a). 
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and 

k 2 (5 + 2u?(c TT F pv F Xp + LggF pu F Xp + L T g(F pu F Xp + = 

(2.19) 

To simplify further, we now insert the explicit form for the background fields written 
in NP form. Here, the six independent components of the field strength F^ v , i.e. the E 
and B fields, are represented by three complex scalars <Pq-,4>\-,4>2 such that (see appendix 
A) 

F» v = - (0i + 4>\Wn v ] + (0! - <j>\ ) [m^m u ] + 4> 2 [l^m u ] + [tm v \ - (f>* [n»m u ] - O [/m"] 

(2.20) 

where the notation is [ab] = ab — ba. Then, after some calculation, we find that eqs.(2.18), 
(2.19) can be written in matrix form as 



k 2 + 2uj 2 {C^ + C gg )^l 2J 2 {L TT - Cgg - 2iCrg)<j%<j>* \ /a 
2uo 2 (C rr - Cgg + 2iCrg)<t>o<fo k 2 + 2u 2 {L TT + Cgg)cp cp* ) \(3 , 



= 
(2.21) 



The coefficients have a particularly natural interpretation in terms of the variables 

X = \(T + iQ) X = X -{T-iQ) (2.22) 

with eq.(2.21) simplifying to 

' k 2 + 2u 2 C xx M* Q 2u?L xx ^l 



2u > 2 £ xx 4>o4>o k 2 + 2u 2 £ xx (p (p 



(2.23) 



The dependence on the background field is especially illuminating. The energy-momentum 
tensor is 

T pv = -F pX F u x + - A g pu F 2 (2.24) 
and in the NP basis, this implies 

T u = T pv tt = 2^1 (2.25) 

This is of course the component of the energy-momentum tensor arising in the 'null energy 
condition 1 , 

T pu et > (2.26) 

As we now see, it plays a vital role in the modified dispersion relations. 

The dispersion relation follows directly from eq.(2.23) as the requirement that the 
determinant of the matrix vanishes, or in other words, the allowed values of k 2 are the 
eigenvalues of the matrix 

■2u?C xx 4>^l ^ 2 c xx <pi4>*v 



(2.27) 



^^xxMo 1uj 2 C xx (t>o(l> , 



-8- 



k 1 



This gives 

k 2 = - 2u 2 4> <f>* (c xx ± ^/C XX C XX ) (2.28) 

that is, 

- \T^k v (c TT + Cgg ± y/(Crr-Cgg)2+4C% g ) (2.29) 

We see immediately what will later be established as a very general result, viz. that the 
modification in the light cone due to the background field is proportional to the projection 
of the energy-momentum tensor onto the null cone, i.e. the component T^k^k 11 which 
appears in the null energy condition and whose sign is therefore fixed. This dependence 
turns out to be completely general and is the origin of the universal features of light 
propagation observed, for example, in ref.[15]. 4 

Notice also that as a direct consequence of its derivation from the low- energy effective 
action, eq.(2.29) is homogeneous in k^ so there is no dispersion in this approximation. The 
result is of the general form considered above in eq.(2.9) and gives a phase velocity 

^ph(O) = 1 - (c TT + Cgg ± y/ (Crr - Cgg) 2 + 4£ 2 rg ) (2.30) 

The corresponding polarisation vectors are the solutions for a, (3 in eq.(2.21), i.e. the 
eigenvectors of (2.27). This gives 

< = i(eV + e-V) 
+ V2 

= - -^=( e ie m» - e- ie fh^) (2.31) 

where the phase is defined from (/)q\/Cxx = re 1 ® . These combinations a± of the left and 
right circular polarisations represent orthogonal linear polarisations, with the direction 
fixed by the angle 6. The situation is therefore very similar to the case of the polarisation 
eigenstates for propagation in a gravitational background considered, e.g. in ref.[8]. 

Now, as explained above, in order to have subluminal propagation and v p h < 1, we re- 
quire k 2 < 0. Given the null energy condition T^k^k" > 0, it follows from eqs.(2.28),(2.29) 
that this is ensured if 




-xx 



> (2.32) 



4 In fact, this analysis sharpens the conclusion of ref.[15] by making clear that it is the null energy projec- 
tion T^k^k" which controls the dispersion relation rather than the energy density itself. This observation 
clarifies a number of other results in the literature. For example, in their paper [16] discussing the possibility 
of an experimental observation of electromagnetic birefringence, Heinzl et al. [16] relate the low-frequency 
limit of the refractive index to 'an energy density Q 2 ' where 

Q 2 = E 2 + B 2 - 2S.k- (E.k) 2 - (B.k) 2 

and S is the Poynting vector. From the identities in the appendix, it is easy to check in our conventions 
that Q 2 is indeed just 2 T^t . (See also ref.[17].) 
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or equivalently, 

(C-FF &FQ ' 
| > 0, C^ + Cgg > (2.33) 

Cjrg Cgg 

These are the constraints placed on the coefficients of the leading irrelevant operators in a 
general low-energy effective action by the requirement of no superluminal propagation, at 
least (see section 3) the requirement that f p h(0) < 1. 

If we specialise to the Euler-Heisenberg action (2.2), we can see how this constraint is 
realised in low-energy effective QCD. We find 

4 a 2 4 a 2 

£-FF = -T7— Cl, £gg = -[7— c 2, C FG = (2.34) 

45 m 4 45 mr 

and the positivity constraint reduces to simply c\ > 0, C2 > 0. Of course this is satisfied 
by the QED values c\ = 4, C2 = 7. 

The new light cone condition for the Euler-Heisenberg action is simply 

fc 2 + 44v^" [ 4 +-7-] = (2.35) 
45 mr 

corresponding to the linear polarisations a^,a^ respectively. This reproduces the results 
found in our earlier paper [5] where the dependence on the null energy component T^k^k" 
was first recognised. It was observed there that the positivity of c\ and C2 also ensured the 
positivity of the trace of the energy-momentum tensor, but subsequent work [18] has shown 
that this apparent link between causality and the conformal anomaly is merely a low-order 
artifact. It also reproduces the classic result for photon propagation in a purely magnetic 
background field [19, 20]. In this case, it is straightforward to check that the phase angle 
entering the formula for the polarisation eigenstates is just tan# = B1/B2 (where we take 
the direction of propagation to be the 3-axis), so that lies parallel to B while a+ is 
orthogonal. 

To summarise, working in low-energy electrodynamics, we have seen how the light 
cone condition is modified by the component of the energy-momentum tensor which is 
fixed by the null energy condition. The eigenstates of definite phase velocity are linear 
polarisations with direction fixed by the background field. Ensuring v p ^ < 1 imposes 
positivity constraints on the coefficients of the leading irrelevant operators in the effective 
action. The modified light cone condition is homogeneous and quadratic in k and there is 
no dispersion. 

However, the absence of dispersion is merely an artifact of the low-energy approxima- 
tion. In general, there is non-trivial dispersion and the light cone condition (2.29) only 
fixes the low-frequency limit v p h(0) of the phase velocity. In the following section, we dis- 
cuss more carefully what causality actually requires and find that, far from imposing any 
constraint on the low-frequency behaviour of the phase velocity, it imposes a constraint 
on the high-frequency limit, iv,h(oo) < 1. In turn, this is controlled not by the low-energy 
effective action but by the UV limit of the quantum field theory. 
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3. Speeds of light, causality and dispersion relations 

In order to understand whether causality does indeed impose constraints on the parameters 
of a low-energy effective action, we first need to understand more precisely exactly what 
we mean by the 'speed of light'. 

By definition, a low-energy effective action is an expansion in powers of derivatives 
acting on the fields. For example, the Euler-Heisenberg effective action (2.1) contains 
only terms of 0(F 4 /m 4 ) leading, as we have seen, to a homogeneous dispersion relation for 
photon propagation with each term of 0(k 2 ). However, this neglects higher derivative terms 
beginning with 0(D 2 F 4 /m 6 ), which modify the dispersion relation with terms suppressed 
by 0(k 2 /m 2 ). This is why the 'speed of light' derived in section 2 from the low-energy 
effective action is just the low-frequency limit of the phase velocity, i.e. u p h(w)|w->o- 

In fact, there are many 'speeds of light', of which the phase velocity is just one. A 
comprehensive account is given in the classic text by Brillouin [21], which considers in 
detail the propagastion of a sharp-fronted wave pulse in a medium with a single absorption 
band (see below). The essential ideas are explained in our review [8]. As well as the 
phase velocity Vp^uj) = pr, we need to consider the group velocity v gp (uj) = jj^j- and the 
wavefront velocity, i.e. the velocity at which the boundary between the regions of zero and 
non-zero disturbance of a wave pulse propagates. This definition of wavefront is identified 
with the characteristics of the partial differential equation governing the wave propagation 
and it is the wavefront velocity v w f is the speed of light which is relevant for causality. 

Other definitions of the 'speed of light', each of which is useful in a particular context, 
can be found in ref.[8], e.g. signal velocity, energy-transfer velocity, etc. In addition, as we 
have seen in section 2, the ray velocity also plays a conceptually important role. 

We now show how the wavefront velocity is related to the phase velocity. The following 
proof follows a paper by Leontovich [22] , as summarised in ref. [8] . We show that for a very 
general set of PDEs the wavefront velocity associated with the characteristics is identical 
to the k — > oo limit of the phase velocity, i.e. 

v wi = lim ^- = lim v ph (u) (3.1) 

fe^OO |fc| <J— >00 

The first step is to recognise that any second order PDE can be written as a system of 
first order PDEs by considering the first derivatives of the field as independent variables. 
Thus, if we consider a general second order wave equation for a field u(t,x), in just one 
space dimension for simplicity, the system of PDEs we need to solve is 

"ij^+bij^ + Cirfj = (3.2) 

where & = 

Making the 'geometric optics' ansatz 

4>i = <Pi exp i(wt — kx) (3-3) 

where the frequency-dependent phase velocity is v p h(k) = uj{k)/k, and substituting into 
eq.(3.2) we find 

(^iuoaij — ikbij + Cij^jifj = (3-4) 
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The condition for a solution, 



det 



aijV ph (k) 



i 



= 



(3.5) 



then determines the phase velocity. 

On the other hand, we need to find the characteristics of eq.(3.2), i.e. curves C on which 
Cauchy's theorem breaks down and the evolution is not uniquely determined by the initial 
data on C. The derivatives of the field may be discontinuous across the characteristics 
and these curves are associated with the wavefronts for the propagation of a sharp-fronted 
pulse. The corresponding light rays are the 'bicharacteristics'. 

We therefore consider a characteristic curve C in the (t, x) plane separating regions 
where fa = (ahead of the wavefront) from ^ / (behind the wavefront). At a fixed 
point (to,xo) on C, the absolute derivative of fa along the curve, parametrised as x(t), is 
just 

dfa d(f>i 



dt 



dt 



o dx 



dx 
o dt 



(3.6) 



where dx/dt = v w { gives the wavefront velocity. Using this to eliminate -Jjf- from the PDE 
eq.(3.2) at (to,^o)> we find 



dx 



+ a 



dt 



+ c 



do) 



ijYj 







(3.7) 



Now since C is a wavefront, on one side of which fa vanishes identically, the second two 
terms above must be zero. The condition for the remaining equation to have a solution is 
simply 

det a,ijV w f — bij = (3-8) 

which determines the wavefront velocity v w f. The proof is now evident. Comparing 
eqs.(3.5) and (3.8), we clearly identify 



Vwt = v ph \ k . 



(3.9) 



The conclusion is that the wavefront velocity is in fact the high-frequency limit of 
the phase velocity. That is, the 'speed of light' which is relevant for causality is not the 
low- frequency phase velocity f p h(0) but its high-frequency limit t> p h(oo). 

In flat spacetime, it is therefore natural, and indeed true, that causality requires 
Uph(oo) < 1. In curved spacetime, however, the requirement of causality is much more 
subtle. Indeed, as explained in ref.[8] (see also ref.[12, 23]), it is in principle possible to 
maintain a causal theory even with v p h(oo) > 1 provided that the background spacetime 
exhibits a modified version of 'stable causality'. This possible loophole is most easily il- 
lustrated for the case considered in section 2 where the apparently superluminal wave 
equation is homogeneous and quadratic, although the physical idea remains relevant for 
general propagation equations. In this case, the dispersion relation may be written as 
G^k^ku = 0. As explained in section 2, the effective light cone is then determined by an 
'effective metric' G^ Ul where G = which depends on the background fields and which 
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we think of as being perturbatively close to the spacetime metric g^ v . This generalises the 
standard general relativity assumption that light follows null geodesies, in which case the 
dispersion relation is just g^ u k^k v = 0. 

The concept of stable causality is summarised in the following definition and theorem 



• A spacetime manifold (Ai,g^ u ) is stably causal if the metric g^ u has an open neigh- 
bourhood such that A4 has no closed timelike or null curves with respect to any metric 
belonging to that neighbourhood. 

• Stable causality holds everywhere on M if and only if there is a globally defined function 
/ whose gradient D^f is everywhere non-zero and timelike with respect to g^ u . 

According to this theorem, the absence of causality violation in the form of closed 
timelike or lightlike curves is assured if we can find a globally defined function / whose 
gradient is timelike with respect to the effective metric G^ u for light propagation. / then 
acts as a global time coordinate. That this can occur is demonstrated for the example of 
'superluminal' light propagation in a Friedmann- Robertson- Walker spacetime in ref.[8]. 

Setting this curved-spacetime subtlety aside for the moment, our conclusion here is 
that the restriction imposed by causality on the propagation of light is not concerned with 
the low-frequency, IR limit w p h(0) determined by the low-energy effective action. Rather, 
causality imposes a bound on the high-frequency limit, i> p h(oo) < 1. This is determined by 
the UV nature of the quantum field theory. 

At first sight, therefore, it appears that causality imposes no constraint on the struc- 
ture of low-energy effective theories. However, there is one further twist. The (complex) 
refractive index n(uj) satisfies a Kramers-Kronig (KK) dispersion relation 5 



In a conventional dispersion relation, the imaginary part of the refractive index, i.e. the 
absorption coefficient, is positive, Im n(uj) > 0. This implies that n(oo) < n(0) and so, 
recalling Vp^uj) = 1/Re u(lo), that t> p h(oo) > v p h(0). If so, the phase velocity deduced 
from the low-energy effective action would be a lower bound on the wavefront velocity 
v w { = v p h(co) and so a superluminal f p h(0) would indeed represent a violation of causality. 

Similar KK dispersion relations arise in quantum field theory applied to correlation 
functions and scattering amplitudes M(s, t). The axiomatic inputs into their derivation are 
the standard ones of local quantum field theory, notably micro-causality, viz. the vanishing 
of commutators of field operators evaluated at spacelike separated points. In conventional 
QFT, the optical theorem relates the imaginary part Im _M(s,0) of a forward scattering 
amplitude via unitarity to the total cross section, so positivity is assured. 

It therefore appears that positivity of Im n(co), and consequently -u p h(oo) > f p h(0), is 
guaranteed in any situation of interest. This would ensure the validity of causality bounds 
derived from the low-energy effective action. However, this conclusion may be too fast. For 

5 In what follows, we always refer to this as the 'KK dispersion relation' to avoid any possible confusion 
with our use so far of 'dispersion relation' to describe the dependence of the frequency u)(k) on wave- number 
k determined by the light-cone condition. We also drop the convention c = 1 in this section. 



[24]: 



n(oo) 




(3.10) 
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example, consider an optical medium with a single absorption band. Its refractive index 
may be modelled as [21] 



n 2 (w) 



1 



M 2 



2 — oj" 2 + i5uj 



(3.11) 



where M determines the absorption strength, ujq the characteristic frequency of the medium 
and 5 the width of the absorption band. The real and imaginary parts of n(u>) are sketched 
in Fig. 1. 





Figure 1: The left hand figure shows the real part of the refractive index Re n(oj) for the single 
absorption band model (3.11). The dashed line is n{ui) = 1. The right hand plot is the corresponding 
Im n(ui). 




Figure 2: The left hand figure shows the frequency dependence of the phase velocity v p h(oj) (solid 
green line) and the group velocity v gp (u>) (dashed blue line) for the single absorption band model. 
The constant wavefront velocity u w f = 1 is indicated by the long-dashed red line. The right hand 
figure shows the corresponding indices, n p h = Re n(u) = l/v p h and n gp (uj) = \/v gp . 

This is the standard situation. The imaginary part Im n{u) is positive, while Re n(co) 
initially rises ('normal dispersion') then falls rapidly through the absorption band ('anoma- 
lous dispersion') passing through Re n = 1 at the characteristic frequency u)q before asymp- 
totically approaching n(oo) = 1 from below. In terms of the phase velocity, this gives 
-u ph(0) < 1 an d fph(oo) = v w f = 1, with an interesting behaviour in the vicinity of the char- 
acteristic frequency ujq. The bound u p h(oo) > w p h(0) clearly holds. In the following section, 
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we derive the explicit behaviour of v p h(u>) for QED in a background electromagnetic field 
and show how the above features are realised in a QFT context. 

It will be clear, however, that all this would be reversed if we were in a situation where 
Im n(u>) < 0. In fact, this can easily be realised in, for example, atomic physics systems 
such as we describe in section 6. It corresponds to the system exhibiting gain rather than 
absorptive scattering. The big question is whether such a phenomenon can arise in QFT. 
In section 5, we show that for QED in a gravitational background we can indeed find 
a superluminal low-frequency phase velocity v p h(0) > 1. This poses a clear dilemma in 
what is after all a very conventional QFT, albeit involving gravity: either we really find 
^ph(oo) > ^ph(O) > 1 in which case we would be forced to reassess fundamental issues 
regarding causality, or Im n(uj) < 0, i.e. propagation and scattering in a gravitational field 
can exhibit the characteristics of gain as well as dispersion. 

All this will be explored in detail in future sections. It is immediately clear, how- 
ever, that we need to be extremely careful before drawing the conclusion that apparently 
causality- violating values of couplings in low-energy effective actions necessarily imply the 
absence of a well-defined UV completion within the usual axioms of QFT, possibly includ- 
ing gravity. 

4. The QED refractive index 

In this section, we find an explicit representation of the dispersion relation for photon 
propagation in QED in a constant background electromagnetic field. This shows how 
the frequency dependence of the refractive index described above is actually realised in a 
well-understood quantum field theory. 

The field-theoretic calculations required to evaluate an effective action valid for all 
momenta in a general background field are still beyond current techniques, so here we 
restrict the discussion to constant background fields. Also, since we are concerned only 
with analysing photon propagation, it is sufficient to evaluate the vacuum polarisation 
tensor (the second functional derivative of the effective action) in the given background. 
The state of the art in such calculations are those of Schubert and Gies [11, 25, 26] using 
the worldline path integral approach to QFT. This generalises the earlier classic work on 
QED in background fields using the more conventional Schwinger proper time or heat 
kernel methods (see for example [27, 19, 20]). Here, we start from the result obtained in 
ref.[ll] for the vacuum polarisation tensor in a constant background electromagnetic field 
and translate it into the NP formalism before considering in detail its consequences for 
photon propagation, including the realisation of the KK dispersion relation. 

Our starting point is therefore the formula (eq.(4.11) of ref.[ll]) for the one-loop QED 
vacuum polarisation: 
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(4.1) 



where 

aF-bF _ bF + aF 

z + = tesa Z- = —esb Z + = — ^ Z_ = — i — ^ (4.2) 

or + o z a 2 + o 2 

with 

a = \/ VF 2 + G 2 + 7 r b = \J V^ 2 + G 2 - (4.3) 

The explicit expressions for the dynamical coefficients C a , S a p, A a p are given later. 
The dispersion relation for photon propagation is 

k 2 a u - k.ak v - = (4.4) 

Clearly, given the vacuum polarisation in the form (4.1), it is far from transparent what 
its implications are for the physics of photon propagation. However, simplifications and 
physical meaning rapidly become apparent as we translate into the null tetrad, NP form. 
In fact, it may well be that the most efficient method for calculating vacuum polarisation 
in background fields would be to adopt the NP formalism from the outset. Indeed, this 
may offer the only realistic hope of reproducing similar vacuum polarisation calculations 
for background gravitational fields (see section 5). 

4.1 General structure of the vacuum polarisation and dispersion relations 

As before, we take k^ = \^2lo£ )1 + 0(a) and polarisation vectors = am) 1 + /3m^ and look 
for the eigenvectors and eigenvalues of the dispersion relation. Contracting with m u , rh u in 
eq.(4.4), we have 

/ /•- . 1 1 II \ / a\ 

' = (4.5) 



mm ±± mm 



n mjTt k + II mm / \(3 
where H mm = Ii l _ LU m^m 1 ' etc. 

The first step is to translate the various 'kinematic' terms in the vacuum polarisation 
tensor into NP form. The necessary intermediate formulae are collected in the appendix. 
For the required contractions, we find 

7 a ~ ih A ~ 7 -r- a + ib '* 

m.Z±.i = — ^ -r (pQ m.Z±.t = =F 



a 2 + b 2 ^ , a2 + b2 ru 

LZ 2 ± 1 = ±-J_2fo$ m.Z 2 ± l = T _L_20o# fh.Zll = T _L_2^ 

(/- • />- " a- ■ b- " <i- ■ b- 

m.Zl.m = m.Zl.fh = ± — = 2d>c\d>% m.Zl.fh = fhZl.m = - ± — » » 2<hi<h1 

± ± a 2 + b 2 2 ± ± 2 a 2 + b 2 1 

(4.6) 

where we have also used the results: 



a 2 + b 2 = 2\/T 2 + Q 2 = 4VX£ 
a 2 - b 2 = IT = 4 Re (0 o </>2 - <p\) lab = 2Q = 4 Im (e/> o 2 - <t>\) (4.7) 
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This immediately begins to show the simplifications which arise by using the NP compo- 
nents of the electromagnetic field strengths. Several remarkable cancellations now occur. 
To illustrate with just two examples, we find: 



= Wo 



S ++ 

2 



1 



LZ\.l - (Z 2 + £) m (Z 2 + £) T 



(a 2 + b 2 Y 



W " (0I) 2 ) 



>++ 



1 

AX 



4>l S ++ 



(4.8) 



while 



(Z 2 + ) r - nm £.Z%.£ - (Z\l) m {Z\l\ 

\\ + a^V ^ - 



A\TXX 



^>o0o 5"++ 



20o (, 



S. 



++ 



(4.9) 



Collecting terms and substituting back into eq.(4.1), we therefore find the following 
simplified form for the relevant components of the vacuum polarisation tensor in the NP 
basis: 



z + z_ 



tanh z + tanh z. 



>(-' 



2 i j* 

expl — isuj 0o0o 



2y/~XX 



) flAB 



(4.10) 



where 



UJ 0000 
W 2 0o0o 



S ++ - S+_ + S_+ - + A++ + - - 
= (S ++ - S + _ + - 5__ + A ++ - + - 
S+- - S-+ + S"__ + A ++ + A +_ + A_+ + A_ 



2v / A 7 A 7 
1 

2X ( S++ 



111 



Notice that up to this point we have assumed nothing about the explicit form of the 
coefficients C a ,S a/ 3 and A a p. Self-consistency is guaranteed, however, by certain explicit 

properties, e.g. both A a p and S a p are symmetric on a, (3 while all S a p are real, A ++ ,A 

are real and A-\ , A |_ are pure imaginary. Imposing these properties, we therefore find in 

the m, fh sector: 



1 



-UJ 



A 



i 



xx 



0000 



B* i 0^05 



A — L - 



W 



with 



and 



B 



A 

S++ 



-- S++ - S— + A++ - A— 
2S+_ + + A ++ + 2^+_ + 



(4.12) 

(4.13) 
(4.14) 
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This matrix has eigenvalues 



^ Ml Tff ( A ± y/EjF ) (415) 

Finally, putting all this together, we find the dispersion relation for photon propagation 
in a constant background electromagnetic field, as determined by the vacuum polarisation 
tensor (4.1), is: 

k 2 ^ r ds ism 2 f l z + z_ — exp (_ is ItuvW (C+ - C-j) 

4vr J s J_! tanhz + tanhz_ F V 8 M \f?CX > 

x i TuvkW — (A± v#)] = (4.16) 
corresponding (see section 2) to linear polarisation vectors 

4 = ^(e ie m + e'^rnj 

--j=(e i6 m - e- w mj (4.17) 



with the phase defined by <Aoy x = re * 9 - 

The most remarkable feature of this result is the appearance of the null energy com- 
bination Tuvk^k" = Auj 2 cf)Q(j)Q, both as an overall factor just as in the low-frequency limit 
and also in the exponent which, as we shall see, controls the high-frequency limit. This 
is an exceptionally clear demonstration of the direct relation, valid for all frequencies or 
momenta, between the null energy condition and the presence or absence of superluminal 
propagation of light. 

4.2 Dynamics and the frequency dependence of the refractive index 

Having established that the relation between photon propagation and the null energy con- 
dition remains valid for all frequencies, the next step is to investigate the detailed dynamics 
of the dispersion relation. This is encoded in the coefficients C a ,S a /3 and A a p, which are 
functions only of the two Lorentz invariant combinations of the background electromagnetic 
field strengths, T and Q. 

The required formulae can be extracted from the results derived in ref.fll]. We find 



1 / coshz„f 



C a = coth z c 

z a \ smhz a 

cosh z a v cosh zpv sinh z a v sinh zpv 
Q/3 cosh,z Q cosh sinhz a s'mhzp 

/coshz Q v , \ / \ sinh z a v sinh zgv 

A a p = - — - -cothz a + tanhz a )(a^(3) + -2 -2- 4.18 

\ sinh z a / V / cosn z a cosn z /3 
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Expanding these in a weak field expansion, we find 

I.. o. /. 1 



c a = i(i-- 2 )(i-^(i-^ + --.) 



A af} = -l(l- v 2 f ZaZf} + ... (4.19) 



Recalling z + = iesa and z_ = — esb, and noting a + ib = 2\/X, we verify that the com- 
binations of the S a/ 3 and A a g occurring in eqs.(4.13) and (4.14) have precisely the correct 
X, X dependence to cancel the inverse field strength factors. Obviously this must happen 
to ensure that the dispersion relation involves only positive powers of the background field 
strengths, but it is a very non-trivial consistency check. For the coefficients A and B we 
find explicitly 

A = -e 2 s 2 VxX~ (1 -« 2 )(3- \v 2 ) + ... 

B = e 2 s 2 X (1 - v 2 ) 2 + ... (4.20) 



while 



C+-C- = e 2 s 2 ^(l-v 2 ) 2 v f XX (4.21) 



While we could continue this analysis for a general constant background field with 
arbitrary T and Q, since our main interest is in the frequency dependence of the dispersion 
relation we now simplify to the important special case of a pure magnetic field. 

In this case, T = \B 2 , Q = so a = V2J 7 , b = and z + = ies^lT ', z_ = 0. It is 
convenient to introduce the notation z = —iz + = esB, with the hyperbolic functions of 
z + becoming trigonometric functions of z. Then, since B is real, the eigenvalues of the 



"The dictionary between our notation and that of ref.[ll] is: 
C a = (A B12 — ^sn) 

Zoc 

Oaf} — JB12 J B12 ~ °F12 J F12 



la/3 



(A a B12 - A a B11 + A a F11 )(A B12 - A B22 + A 







+ A F i 2 A F12 



where the worldline formalism coefficient functions are: 

a _ sinh2 a « a coshz a « 

&B12 — —r~, &F12 — <-rF12 : 

sinh z a cosh z a 

a coshz Q t> 1 „ smhzaV 

Ab\2 = — —. A F12 = Gf12 : 

sinn z a z a cosh Za 

A B ii = A B2 2 = cothz^ — - A F11 = A F2 2 = tanhza 

z 



where G F12 — 1. 



dispersion relation simply involve the combinations: 



\, , „. COSZV VSmZV COSZf — COSZ 1, 9 , , V 2 . 9 <. 

-OA + fi) = : 2 — = --(l-v 2 )(l + —)z 2 + Oz 4 

2 cos z sin 2 cos z sin 2 z 4 3 

1 . . , 9 . coszv wsinzf 1. 9w w 2 , 9 ^, 

-(.4 - B) = (1 - O 2 ) - — + — = - jd - - y)2 + O(^) (4.22) 

while the exponent involves 

= -^(l-. 2 ) + COS 7 s .; z C ° SZ = ^(l-, 2 )V + 0(z 4 ) (4.23) 



Collecting everything, we find that the dispersion relation (4.16) for a pure magnetic 
background field becomes: 



4vr y s 7_! 



1 



sV^u; 2 — (A±£)J = (4.24) 



This is our final analytic result for the dispersion relation. We highlight three features. 
First, notice how the null energy combination T^ v i^l u automatically projects out the rel- 
evant component of the background field, in this case the component orthogonal to the 
direction of propagation, i.e. Bj_ = B\ + B 2 . 

Next, note that the angle 6 in eq.(4.17) defining the direction of linear polarisation 
for the eigenvalues of k 2 is given in this case by tan# = — f° = J|k This shows that 
the eigenvectors a± correspond to polarisation perpendicular (+) and parallel (— ) to the 
projection of the B field in the plane orthogonal to the direction of propagation. 

Finally, as we now show explicitly, the presence of the factor oj 2 in the exponent in 
eq.(4.24) (inherited from the null energy term T^ u k^k v ) means that for high frequencies 
the exponent will oscillate rapidly and in fact drive the whole integral to zero. This is 
the mechanism which ensures that in the high frequency limit the QED refractive index 
becomes unity, ensuring i> w f = 1. 

To see this in detail, consider eq.(4.24) in the weak field approximation. The appro- 
priate small dimensionless parameter is eB±/m 2 . The dispersion relation reduces to 



k — -r- 



eB ± 



8ir \ m 2 



dv (1 - v 2 ) < 



;i-t) 



f°dtt ew(-*\t + hl-v 2 ) 2 ( e -^)%t 3 }) = 0(4.25) 
Jo \ L 48 V rrr / m z J / 



7 These expressions can be directly compared with the classic work on photon propagation in a magnetic 
field by Tsai and Erber [19, 20]. The equivalences are: \{A + B) -> -*HL£/V_l and \(A-B)^> -^iVy, 
eqs.(49) and (48) of ref.[19, 20] respectively. The exponent also reproduces the equivalent expression in 
ref.[19, 20]. 
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for the | | eigenvalues respectively. The corresponding result for the weak-field QED 
refractive index is therefore 



n{u)) = 1 — 



a 

16tt 



eB± ^j j\v(l-v 2 )< 



m 



f 

Jo 



dt t exp 



,2\2 



(4.26) 



The (inverse) phase velocity and the absorption coefficient are determined by the real and 
imaginary parts of n(uj). The important observation is that the term in the exponent pro- 
portional to i 3 is vital for the high frequency behaviour of n{uj). Although it is suppressed 
by the weak-field parameter eB±/m 2 , it cannot be dropped because it is accompanied by 
the arbitrarily large uj 2 /m 2 factor. This is why it is necessary to evaluate this exponential 
contribution to the vacuum polarisation, rather than restrict to a simple power series ex- 
pansion in the background field. Indeed, this is one of the major challenges in extending 
this analysis to background gravitational fields. 

In the low-frequency limit, the t integral is purely real and reduces to j Q dt t e~' lt = — 1. 
This leaves 



n(0) 



a ( eB± , 

1 + E^b?> |4+ ' 7 - 



(4.27) 



in agreement with the result (2.35) derived from the Euler-Heisenberg low-energy effective 
action. 

In order to map out the full frequency dependence of the refractive index, we need the 
following special functions defined by Airy integrals: 8 



and 



dt t cos 



dt t sin 



x 



( * 
( t + J 



x 



hi) 



(4.28) 
(4.29) 



3 We follow the notation of ref.[28]. Gi(x), defined via the integral 



Gi(x) 



i r 

k Jo 



dt sin xt + 



can be expressed in terms of the Airy functions Ai(x) and Bi(x), or alternatively hypergeometrics, as 
follows: 



Gi(x) = -Bi(x) + 
o 



F 

Jo 



dt 



Ai(x)Bi{t) - Ai(t)Bi(x) 



2 5 7 4^ 
' 3 ' 6 ' 6 ' 3 ' 1296 " 



1 B _, /, 7 4 5 11 1 

z ii*4 1; — ,— ,— , — ; * 

40tt V 6 3 3 6 1296 



(Note the typographical error in ref.[28] where the factor 'z 5 ' is omitted.) The prefactor in our definition 



(4.28) of G(x) compensates the asymptotic behaviour Gi'(x) 



+ . . . as x — ► co. In eqs.(4.30) and 



(4.31) the change of variable u = (1 — v ) has been made to simplify the final formulae. 
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Here, we have defined G(x) = —irx 2 -^Gi(x), where the function Gi{x) is related to the 
standard Airy functions. 



After some further simplification, and introducing the rescaled frequency f2 



3 



we find the following results for the real and imaginary parts of the refractive index: 

»■»(") - ^i^fhl! d "" {1 -" ri {,! *!} G ((i" fi " 

(4.30) 

and 

2 f n — — ) 



3 



(4.31) 



Re n 
1 . lr 




2 4 6 8 10 



Figure 3: This figure shows the real part of the QED refractive index Re n(f2) for a weak back- 
ground magnetic field. The horizontal axis In f2 measures frequency on a logarithmic scale, with 
f2 = ^fj-^;. The magnified vertical scale corresponds to setting ^(-^3-) = 1 in eq.(4.30). The 
solid (dotted) line shows the refractive index for the polarisation a+ (a^) orthogonal (parallel) to 
the magnetic field. 



These expressions are evaluated numerically and the results plotted in Figs. 3,4,5. The 
refractive index for QED shows the same essential features as the single absorption band 
model described in section 3. The absorption coefficient Im n{Q) due to pair creation is 
positive and shows the expected single peak. The real part Re n(£l) initially rises from the 

values (4.27) then falls away gradually to just below 1 before approaching the asymptotic 

_ 4 

value 1 from below, with frequency dependence f2 3 . In terms of the phase velocity 
w ph(w) = 1/Re n(u), the low-frequency limit is subluminal, while asymptotically v p h(tu) 
approaches v w { = 1 from the superluminal side. 

It is worth noting that the absorption band in QED is very broad. This means that 
there is an appreciable difference between the frequency rj max ~ 2.5 defining the maximum 
of the absorption coefficient and the frequency — 20 where (with the definition following 
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Figure 4: The absorption coefficient Im n(f2) plotted against lnf2, corresponding to Fig. 3. 




. 99925 : 
0. 999 - 



the simple model (3.11)) the refractive index passes through 1. The fall-off of Re n(Cl) is 
slow and it has only a very shallow dip into the 'superluminal' regime. (These effects 
are partially masked by the use of the log scale for in the figures.) The corresponding 
frequencies w max and ujq are inversely proportional to the background field B± . 

We now comment briefly on the experimental Re n 

implications of these results. The most striking l.ooost \ 
phenomenon is 'vacuum electromagnetic birefrin- 100025 ,\ 
gence'. A general linear polarisation is a superpo- 
sition of the eigenstates and which prop- 
agate with different phase velocities. After prop- 
agating through some distance, this velocity dif- 
ference introduces a phase difference between the 
eigenstates, and the resulting wave then describes 
elliptic polarisation with the principal axis of the 
ellipse rotated relative to the initial direction of 
linear polarisation. 9 

Experiments to look for vacuum electromagnetic birefringence are currently being 
planned [16, 17] using either an X-ray free-electron laser or a Thomson backscattering 
source to provide a high-frequency polarised pulse to scatter from a high-intensity laser 
background. Even for such high frequencies, however, we still only have ^ ~ 10 -2 and the 
attainable background fields from petawatt lasers with intensity around 10 22 Wcm -2 cor- 
respond to eB±/m 2 ~ 10 -4 . This means that currently envisageable experiments will only 
be sensitive to the low- frequency sector of the weak-field QED refractive index in Fig. 3, 
i.e. the region of normal dispersion (rising Re n(oj)). The anomalous dispersion (falling 
Re n{u))) region correlated with the peak of the absorption coefficient Im n{u)) remains 
beyond the direct reach of current technology. 

9 The PVLAS collaboration [29, 30] has recently reported the observation of a rotation of the polarisation 
plane for linearly polarised light in a 5.5T magnetic field using their high-sensitivity optical ellipsometer. 
The origin of this observation is not yet understood. The vacuum electromagnetic birefringence described 
here is negligible in this system, prompting the speculation that the effect may be due to the existence of 
a new ultra-light axion-like particle with mass in the region of 10 eV. 



Figure 5: This figure shows a magnifica- 
tion of the region of the refractive index 
curve where Re n(tt) falls just below 1. 
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To conclude this section, we have shown in explicit detail how the KK dispersion 
relation (3.10) controls the relation between the IR and UV limits of the phase velocity in 
a well-understood quantum field theory. The absorption coefficient Im n(u>) due to pair 
creation is positive, in accordance with expectations from the optical theorem, ensuring that 
the low- frequency phase velocity i> p h(0) is less than its UV limit u p h(oo) = v w f. Causality 
requires i> w f < 1 and as expected we find that v w f = 1 in QED, a result which arises 
technically through the appearance of a rapidly-oscillating exponential factor in the vacuum 
polarisation. It therefore follows that in QED there is indeed a causality bound f p h(0) < 1. 
As we have seen, the IR limit of the phase velocity is determined by a combination of 
the null projection of the energy momentum tensor and the coefficients of the leading 
irrelevant operators in the low-energy (Euler-Heisenberg) effective action. It finally follows 
that, under the assumption that the null energy condition T^ v k^k v > holds, causality 
does impose the positivity bound (2.33) on these parameters. 

5. Superluminality and QED in gravitational fields 

We now consider photon propagation in a background gravitational field, taking into ac- 
count the quantum corrections induced by vacuum polarisation. Remarkably, in this case 
the low-frequency phase velocity f p h(0) can become superluminal. 

The low-energy effective action for QED in a background gravitational field (the ana- 
logue of the Euler-Heisenberg action) was first calculated by Drummond and Hathrell in 
the original paper [4] in which they discovered the possibility of vacuum-polarisation in- 
duced superluminal propagation. Integrating out the electron field, the one-loop effective 
action is 10 



The coefficients of the leading irrelevant operators of type 1ZFF are a = — 5, b = 26 and 



To study photon propagation with this effective action, we again use the geometric op- 
tics methods described in section 2, making the additional assumption that the background 
field is slowly- varying on scales comparable to the wavelength of light. In contrast to the 
electromagnetic backgrounds, we cannot restrict to constant gravitational fields since for 
a spacetime of constant curvature 11 the only independent component is the Ricci scalar, 

1() There is a further term, dD^F^ D V F V \ with d — —24, but this only affects the dispersion relation 
at 0(ak 2 ) ~ 0(a 2 ) since its contribution to the Maxwell equation involves the equation of motion factor 
D^F^ which is itself already of O(a). The Ricci scalar term also does not contribute to the dispersion 
relation at 0(a) for the same reason. 

11 A spacetime of constant curvature is characterised by 




c = -2. 




R^\p — -r^R{g^\gv P — g^g^x) 
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which does not affect the dispersion relation at 0(a). In this case, omitting the terms 
involving derivatives of the curvatures, which are suppressed by O(^) for wavelength A 
and curvature scale L, the modified Maxwell equation gives rise to the following relation 
(the analogue of eq.(2.16) for a background electromagnetic field): 

k 2 a u - k.ak v - 1 -^(2bR x Jk x k p a u -k x k u a p )-8cR u Xup k x k p a p ) =0 (5.2) 

7207T m z \' J 

We now re-express this in terms of the Ricci and Weyl tensors and introduce the NP 
formalism. In an analogous way to the electromagnetic field strengths, the ten independent 
components of the Weyl tensor are represented by five complex scalars $j (i = 0, . . .4), 
where [31] 

$o = -C^ Xp l»w?l x m p 
*i = -C puXp l p n v l x m p 
$2 = -C puXp i p m v fh x n p 
$3 = -C puXp l p n u m x n p 

$4 = -C^n*ih v n x m p (5.3) 

The independent components of the Ricci tensor are described by four real and three 
complex scalars: 



$ 00 = —:Ru,t p l v 



2'^ 
1 

4' 
1 



$n = ~-R pv (t p n v + m p rh v ) 



1 
1 



$22 = --R pv n p n v 



$oi = $*i = --B.,^m v 

$02 = $20 
$12 = $21 



^R pv m p m v 
- X -R piV n p vn v 



A = ±R = ^R^n" - m"m") 



(5.4) 



As before, we take fe M = \^2u>£ p and the polarisation vectors a M = am 11 + /3m M . Then, 
eq.(5.2) becomes 



■k 2 + ^(b + 2c)$ 00 



90-7T m z u 



fc 2 + 90T^(^ + 2c)$ 




(5.5) 



where we have used the important identity C pvXp i p m v l x m p = 0. The eigenvalues give the 
new light cone 



k 2 + 



2 r 



90vr w? 

The polarisation eigenstates are 



(5 + 2c)$ 00 ± 2c v / $ $o 







(5.6) 



V2 



{e ie m p + e- M m p ) 



-^(e i6 m p 
V2 



(5.7) 
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with the phase defined by = l^o|e 2ie . 
The modified light cone is therefore 



k 2 + 



a 1 



(b + 2c) R^Wk" ± Ac\C^x P k^m v k x m p \ = (5.8) 



360vr m? 

corresponding to a phase velocity 

MO) = l-^^[-(b + 2c) R^t ± 4c|C'^ A /'WTW|] (5.9) 

Notice immediately that using the Einstein equation, the Ricci tensor can be re- 
expressed in terms of the energy-momentum tensor by R^ u = SirT^ (in G = 1 units) and 
the first term can be written in familiar form involving the null energy projection T^k^k". 
In the gravitational case, however, the phase velocity also has a new, polarisation-dependent 
contribution involving the Weyl curvature. The relation between superluminality and the 
null energy condition is therefore more subtle for QED in a gravitational background field. 

For Weyl-flat spacetimes, the situation is similar to the electromagnetic case. Assuming 
the null-energy condition holds, the sign of R^ u k^k u is fixed and the question of whether 
Vph(O) is superluminal is determined by the coefficients of the leading irrelevant operators 
in the low-energy effective action. Specifically, a subluminal v p h(0) < 1 requires a bound 
(6+ 2c) < 0. Remarkably, this is violated by QED in a background curved spacetime where, 
as we have seen, b = 26 and c = —2. 12 Both polarisations have the same phase velocity 
and there is no birefringence. 

For Ricci-flat spacetimes, i.e. vacuum solutions of Einstein's equation where T^ u = 0, 
eq.(5.9) shows that for some directions and polarisations, photons must propagate with 
a superluminal w p h(0) > 1 independent of the values of the low-energy couplings. If one 
polarisation is subluminal, the other is necessarily superluminal, satisfying the polarisation 
sum rule ^ + _ Sv p h(0) = 0. The theory exhibits vacuum gravitational birefringence. 

We illustrate this with two examples. First, consider a Friedmann-Robertson- Walker 
(FRW) spacetime [4, 8]. This has vanishing Weyl tensor, is spatially isotropic, and has 
Ricci curvature corresponding to the energy-momentum tensor = (p + P)ej 1 e t u — Pg^v, 
where p is the energy density, P is the pressure and specifies the time direction in 
a comoving orthonormal frame. The speed of light is the same in all directions and is 
polarisation independent. Eqs.(5.8), (5.9) give 

k " = 2 ~k—2 ( 5 - 10 ) 
45 m z 

and 

MO) = 1 + ^ (P + P ) 

This confirms the surprising result that the gravitational effect of a positive null energy 
projection T^k^k^ is to increase the phase velocity, resulting in superluminal propagation 
in Weyl-flat spacetimes. 



12 Note that i(6 + 2c) = 11, revealing the universal factor 11 first identified by Latorre et al. in ref.[15] as 
characterising 'energy density' effects on the speed of light. Note that the polarisation sum ^ + _ v p h(0) for 
electromagnetic fields involves the coefficients 4 + 7 = 11. This universality arises because of the common 
dependence on the one-electron-loop vacuum polarisation. 
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As an example of a Ricci-flat background with non-vanishing Weyl tensor, consider 
Schwarzschild spacetime. In the NP formalism, a standard choice of null tetrad £^,n^, m^^fh^ 
is made which reflects the properties of the null geodesies. Specifically, in terms of a con- 
ventional orthonormal tetrad (ef , , , e^), we assign 

l» =^(r 2 ,A,0,0) 

= -^(0, 0, 1, i cosec#) (5.12) 

where A = r 2 — 2Mr. We restrict ourselves to planar motion, so without loss of generality 
set 6 = tt/2 from now on. 

To consider modifications to the principal null geodesies (radial directed) we choose 
k^ = \f2u)l^. The derivation of the new light cones follows precisely eqs.(5.5) to (5.7) and 
we find 

k 2 ± -^^c |*o| = (5.13) 
457T m £ 

Now, it is a crucial feature of Schwarzschild spacetime (and indeed a large class of familiar 
black hole spacetimes such as Kerr) that it is of Petrov type D and the only non- vanishing 
NP Weyl scalar is VP 2 = —M/r 3 . This is a consequence of the Goldberg-Sachs theorem and 
reflects the shear-free character of the principal null geodesies. It follows immediately that 
for radial motion, the light cone k 2 = remains unchanged and f p h(0) = 1. 

Next, consider the modified null geodesies in the r, <f> plane. Expressing the tangent 
vector = ^7- in terms of the NP basis gives 13 

L " = (X' F '°'S) = \(l + F)£»+ r ^(l-F)n»-^=°(m»-fnn (5.14) 



13 The Schwarzschild metric is 



2 /\ 2 ^" 2 2/2 2 2\ 

ds = —rdt — dr — r (dO + sin 8d<h ) 



The t, 4> geodesic equations in the 9 = \ plane are 



d 2 t /A' 2\ dt dr _ Q d^ + 2#dr _ Q 

ds 2 \ A r I ds ds ds 2 r ds ds 



and integrate to give 



d£_r_ d4 _ \_ D 

ds A ds r 2 



Substituting into the metric, we then find that for a null interval 

dr _ ( 1 _£ 2 A\^ 



ds 

The stated result for L M = follows immediately. The radial geodesic is the special case where the 
impact parameter vanishes, D = 0. 
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where D is the impact parameter and we have abbreviated F = ^1 — ^p^j 2 ■ A convenient 
choice for the orthogonal circular polarisation vectors is 



= -±-(p,£*,i,-F) = ^^-n») + l -(l + F) m »+ l -(l-F)fn» (5.15) 

where L.M = 0, M.M = —1. Now set fc M = \/2ujL^ and take the polarisation eigenstates 
as = aM^ + j3M^. As in eq.(5.5) we find 



(5.16) 



~9^F^ 2c C LMLM k -g^^cC^LM/ V/ 3 , 

where Clmlm = C pv \ p L p M v L x M p etc. The Weyl tensor for Schwarzschild may be written 



as 



C pv \ p = - [m p m u m x m p ] + [£ p m u n x m p ] + [£^m u n x m p ]j (5.17) 

where the notation [. . .] denotes a sum over permutations with the symmetries of C pvXp . 
Evaluating the required components, we readily find 



3 



c lmlm = C L mlm = C LMLM = - D ^ 2 (5.18) 



It follows that the new light cone is 

907r m 2 r 5 



k 2 ± -2c — D 2 = (5.19) 



that is 

2a 1 3M 9 , 

Uph = 1 ± ^ — ^ 5 - 20 

457T m" 1 r & 

restoring c = —2. Since ^2 is real, the corresponding polarisation eigenstates are simply 

v2 

a M = - -^=(M M - M M ) (5.21) 
v2 

is the linear polarisation transverse to the direction of k p in the plane of motion, while 
is the orthogonal linear polarisation along e$. This provides an important example of 
gravitational birefringence with a superluminal f p h(0). 

This brings us to the key question. If it is possible to have a superluminal phase 
velocity in the low-frequency limit, how can this be reconciled with causality, given the KK 
dispersion relation? 

First, we review briefly what is known directly about dispersion for photon propagation 
in background gravitational fields. Adapting the pioneering background field calculations 
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of Barvinsky et al. [32, 33], we constructed the QED effective action at 0(1ZFF) to all 
orders in derivatives [7, 6]. This action is 14 

J dx^g C = j dx^d -\f^F^ 

+ ^(ao RF^ U + t R^F^F\ + R^ Xp F^F x ^ 
+ -±j(al RD.F^D V F\ + &i Ra V D x F Xlx D p F pv 



to 4 V 

+ t 2 R^D»F x PD»F Xp + 6^ R pu D»D x F Xp F^ 
+ cf R^ Xp D a F°»D x F^ ) 



+ \ bl R llu D^D x F x ^D u D p FP a 
vnP 



(5.22) 

In this formula, the a^, b n , are known form factor functions of three operators, i.e. 

n 2 n 2 n 2 

at = (5.23) 
V m z m z m z / 

where the first entry (F>^) acts on the first following term (the curvature), etc. It reduces 
to the Drummond-Hathrell action (5.1) in the low-energy limit. 

This action contains all the information required to extend the dispersion relation from 
the zero-frequency limit into the low-frequency region. Extracting this from the action 
involves a number of subtleties described in full in ref.[6]. The result is an expression 
resembling eq.(5.8), viz. 



,2 a 1 
k + 



3607T m 2 



^) R^k v ± G(^) \C^ Xp k»m v k x m* 



(5.24) 



where the constant coefficients are replaced by functions of the operator k.D, which de- 
scribes the variations of the curvature along the unperturbed null geodesies. The precise 
form of the functions F and G in terms of the form factors in eq.(5.22) is given in ref.[6]. 

However, it seems unlikely that this will be sufficient to describe the high-frequency 
behaviour of the dispersion relation. An equivalent approach applied to electrodynamics, 
in which we keep all orders in derivatives but restrict to lowest order in the field strengths, 
would miss the crucial exponent factor in the vacuum polarisation. This arises naturally in 
both the heat kernel and worldline path integral approaches and it would be extraordinary 
if the gravitational case was very different. This leads us to conjecture the following form 



14 This expression is slightly simplified from the form quoted in refs.[7, 6]. Here, we have used the identity 

j dx^f~g R pv D p D v F Xp F Xp = - j dxj~g R„ V D P F Af ' D h ' F Xp + dxj~g D 2 RF Xp F Xp 

to relate two terms considered independent in refs.[7, 6]. The form factors are adapted accordingly. We have 
also omitted a term D p F px ~S oD l/ F u \ since, as explained above, it does not contribute to the dispersion 
relation at O(a). 
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for the dispersion relation for QED in a gravitational background, analogous to eqs.(4.16) 
and (4.24) for the electromagnetic case: 

= (5.25) 

Here, SI ~ where 1Z denotes some generic curvature component, and J\f(s,lZ) and 

V(s, TV) can be expanded in powers of curvatures and derivatives, with appropriate powers 
of s. Neglecting the exponent factor is only valid for small i.e. for wavelengths satisfying 

A 3 

^ S> -p-, where A c is the electron Compton wavelength and L is a typical curvature scale. 
This is too near the IR to see the interesting structure in the refractive index. 

Assuming this picture is correct, we will not be able to complete a direct evaluation of 
the full frequency dependence of the refractive index for QED in gravitational fields until 
new QFT techniques are developed which allow a calculation of the exponent contribution 
to the vacuum polarisation. The gravitational case has two main extra difficulties over pure 
electromagnetism - one is simply the plethora of indices associated with higher powers of 
curvature, but more importantly we cannot restrict to constant background fields since, 
as explained above, constant curvature spacetimes do not have the required spacetime 
anisotropy needed to modify the dispersion relation. The most promising approach would 
seem to involve formulating the heat kernel or worldline path integral methods using the 
NP formalism from the outset. For example, this would allow us to study black hole 
backgrounds in terms of only one non- vanishing NP curvature scalar ^2- Even so, the 
analysis looks far from straightforward. 

Now return to the KK dispersion relation (3.10). Recall that this predicts, under the 
usual assumption Im ti(oj) > 0, that the high-frequency limit v w { = i> p h(oo) of the phase 
velocity is greater than v p h(0). However, for QED in gravitational fields, we have seen that 
it is possible to have a superluminal low-frequency limit f p h(0) > 1. The question is, how 
can this be reconciled with causality? 

We now consider the possible resolutions in turn: 

1. It is possible that the KK dispersion relation does not hold in curved spacetime. How- 
ever, the assumptions in the derivation of eq.(3.10) and similar relations for scattering 
amplitudes are the just the fundamental axioms of QFT, especially local Lorentz invari- 
ance and micro-causality, i.e. the vanishing of commutators of local operators for spacelike 
separations. These should also apply in curved spacetime, certainly for globally hyperbolic 
spacetimes (which admit a foliation into spacelike hypersurfaces), and we have examples 
of superluminal v p h(0) in such spacetimes. Invalidity of the KK dispersion relation itself 
therefore looks unlikely. 

2. It may be that u p h(oo) is indeed perturbatively greater than 1 and the physical light cone 
lies outside the geometric one, but that nevertheless the spacetime satisfies the criteria for 
stable causality with respect to the effective metric characterising the physical light cone. 
This is the situation discussed in section 3. However, this seems unlikely. If our conjecture 
(5.25) about the form of the vacuum polarisation is correct, it seems extremely probable 
that in the high-frequency limit the presence of the factor will result in the exponent 



k 2 + — dsJ\f(s,K) exp -is(l + s 2 tt 2 V{s, TV)) 
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term oscillating rapidly and, just as in the electromagnetic case, driving the entire integral 
to zero. The QFT evidence appears to point to i> p h(oo) = 1. 

3. The potential paradox would be resolved if it is possible to have Im n(u>) < in gravity. 
At first sight, this appears to be ruled out by the usual QFT identification of the imaginary 
part of a forward scattering amplitude with the total cross-section via the optical theorem. 
Indeed, the imaginary part of the vacuum polarisation should be related, just as in the case 
of a background electromagnetic field, to e + e~ pair creation, which is certainly absorptive. 
Another important physical process, photon splitting in a background field [34, 35] is also 
associated with Im n(co) positive. 

It is hard to see a way out. One line of thinking, as we explore further in the next 
section, is that a negative Im n(cj) corresponds to gain, i.e. an amplification of the ampli- 
tude and intensity of the light wave, rather than absorption. This may give us some hope 
that gravity is special, due to its ability to focus (see also ref . [36] ) . Recalling the geometric 
optics equation (2.6) for the variation of the amplitude A along a geodesic, we readily 
see that including the 0(a) contribution from the vacuum polarisation in the low-energy 
effective action (5.1) gives schematically 

k^D^lnA) = — -= uj 9 + a k^D^K (5.26) 
v2 

The optical scalar 9 = l^-n is the expansion coefficient (we have set k^ 1 = \/2ujI^ here), 
so for focusing geodesies (9 < 0) the amplitude is increased. This is a purely classical 
effect. However, there is an additional O(o) quantum contribution depending on the change 
in the curvature along a geodesic. This arises from the vacuum polarisation and will in 
general show a frequency dependence. The expansion scalar 9 and associated Raychaudhuri 
equation equally show 0(a) corrections [37]. It seems natural that the O(a) corrections 
to the geodesies induced by the vacuum polarisation should be able to enhance focusing 
as well as divergence, corresponding to amplification as well as attenuation. This would 
suggest that a negative Im n(oj) may after all be a viable possibility in gravity. 

Overall, however, the situation remains puzzling. Obviously what is needed to replace 
speculation by hard physics is a direct calculation of the momentum dependence of the 
vacuum polarisation for QED in a curved spacetime. This challenging technical problem 
remains open. 

6. Laser-atom interactions in A-systems: EIT and Raman gain lines 

In this section, we develop some intuition on how it may be possible to have u p h(oo) < 
"Wph(O) and Im n(oj) < by studying light propagation in an atomic physics context. Specif- 
ically, we consider so-called 'A-systems', illustrated in Fig. 6. 

Here, we have three atomic energy levels together with two lasers with frequencies 
uja and lob tuned nearly, but not exactly, to the frequency differences W31 = LU3 — to\ and 

= L03 — L02 of the levels. One of these lasers will be the 'coupling' (or 'pump') field 
while the other will be a 'probe' beam whose propagation through the coupled laser-atom 
system we wish to study. Experimentally, the atomic levels are typically realised by suitable 
hyperfine levels in ultracold gases of alkali metal atoms, Na, Rb, Cs. 
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The principal application of this set-up is 
to electromagnetically-induced transparency 
(EIT). For a review, see [38]. This allows the 
well-publicised phenomenon of 'slow light'. In 
this case, laser B is a strong, coupling field 
and A is the probe. This gives rise to a dou- 
ble absorption line with an intermediate re- 
gion where Im n(w) = (transparency) and 
the refractive index is linear and rapidly ris- 
ing (normal dispersion), giving rise to an ex- 
tremely small group velocity for the probe 
light. This is the physics behind the slow 
light experiments. These have achieved group 
velocities as low as a few ms _1 in an ultra- 
cold gas of Na atoms [39]. Subsequent exper- 
iments [40, 41] have even demonstrated the 
possibility of stopping and reconstructing a light pulse in a A-system in either ultracold 
Na or warm Rb vapour cells. 

For our purposes, we are more concerned with the opposite case where A is the coupling 
(or Raman pump) field while B is the probe laser. Here, we find a single Raman gain line, 
i.e. Im n(u) < 0. This demonstrates how it is possible to achieve the desired condition 
^ph(oo) < Vph(0). Although not directly relevant to our discussion here, this also allows the 
remarkable phenomenon of a negative group velocity. To achieve this, we introduce two 
coupling fields so that we have a gain doublet with an intermediate region characterised by 
virtual transparency with a strong linear anomalous dispersion. If the slope of Re n{io) is 
sufficiently large and negative, this gives a negative group velocity - a light pulse traversing 
a cell with this coupled laser-atom system is advanced sufficiently that its peak leaves the 
cell before entering it. Although certainly counter-intuitive, this astonishing effect is not 
at all incompatible with causality and has been demonstrated experimentally by Wang et 
al. [14, 42, 43] in A-systems in warm atomic Cs gas. 

We now present a unified analysis of A-systems which allows us to describe both the 
EIT and Raman gain scenarios in a common framework. EIT is obtained from our general 
formulae in one limit, while the complementary limit describes Raman gain lines. 

It is most convenient to use a density matrix formulism. Writing the wave function for 
the 3-state A-system shown in Fig. 6 as 

\m) = ci(t) e -^*|l) + c 2 (i)e-^|2) + c 3 (t)e-^|3) (6.1) 

we define density matrix elements as pij = CiC*j. The diagonal elements pu, P22, P33 ar e 
referred to as the 'populations' while the off-diagonal elements pzx, P32, P21 are the 'coher- 
ences'. It is also useful to introduce the notation &ij = 

The laser-atom interaction is described by the potential V = er.E_cosujt, where to is 



I3> 




Figure 6: Atomic energy levels in a A-system. 
The laser frequencies loa and lob are detuned 
from the atomic energy splittings W31 and LO32 
by Ai and A2 respectively. The differential de- 
tuning is S = A 2 — Ai. 
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the laser frequency, with matrix elements 



Vij = e/Aij \E\ cos ujt = Qij cos ut (6.2) 

which defines the Rabi frequencies Qij in terms of the dipole matrix elements p,^. The 
polarisation of this laser-atom system is given in terms of the density matrix elements by 

P = Ar(/x 13 P3ie- iW3lt + fi 23 P32e- luJ32t + c.c. ) (6.3) 

where N is the atomic number density, since the diagonal elements do not contribute 
by the dipole selection rule. Abbreviating ^31 = Q A and ^32 = Q B for convenience, and 
anticipating the solution of the Schrodinger equation for the time dependence of the density 
matrix to set 

P3i = n A e iA ^p 31 p 32 = ^ B e iA ' 2t p 3 2 (6.4) 

we have 

P = N(fi 13 n A e- WAt p 31 + ix^BeT WBt hi + c.c. ) (6.5) 

Recalling that in general P = x e o\E\ cos cot, we find the dielectric susceptibilities for a 
probe laser A in the coupled laser B-atom system, and vice-versa, are 

XA = |Ml3| P31 XB = \P23\ P32 (6.6) 

This is related to the refractive index by n(u) = l + x/2, so we eventually find the following 
essential relations between the refractive indices for lasers A and B and the reduced density 
matrix elements pif 

ua{ua) = 1 + — |^i3| 2 P3i n B (uj B ) = 1 + — |/x 23 | 2 p 3 2 (6.7) 

The next step is to calculate the density matrix elements. They satisfy the Schrodinger 
equation 

where the laser-atom interaction (6.2) is (implementing the standard 'rotating-wave ap- 
proximation' [44]) 

Hi = -i0W Al '<731+fW A2t <T32 + C.C.) (6.9) 

The important equations are those for the coherences. With the initial condition p\\ = 
1) Pi2 = P33 = corresponding to a population initially in state |1), we find 

—j^-- - -ll A e + -U B e p 2 i - -731 P3i 
d P32 i Q iAlt * 1 n 

— - ~n B e p 21 - -732P32 

^ = - ^Ae^ph + l -n B e-^p 31 - \ 121P21 (6.10) 
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where we have included the total widths 7jj describing the spontaneous radiative decay 
of state 1 3) and dephasing [38]. It is straightforward to see that the time-dependence 
implied by these equations is consistent with eq.(6.4). For convenience, we also set p2i = 
exp(i5t) p2i , where S = Ai — A2. Then, with notation Ai = Ai — ^731 etc., we obtain the 
following set of algebraic equations for the reduced matrix elements piy. 

2AiP3i = 1 + ^P21 
2A 2 p 3 2 = ^~P21 

25 P21 = - Qa^*bP32 + Q*B^AP31 



(6.11) 



Solving these by elimination of P21, we find the master formulae relevant for a general 
A-system: 
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(6.12) 



At this point, we specialise to the two cases of interest - EIT and slow light, and Raman 
gain lines. 

6.1 EIT and slow light 

The standard EIT scenario is where B is the coupling laser while A is a weak probe. In 
this case, we are interested in the refractive index ua(uja) for propagation of the probe 
laser light through the coupled laser B-atom system. This is given by the reduced density 
matrix element p^\. Since the probe laser is weak, we assume \Q.a\ 2 / ^5 <C 1 and drop 
the corresponding term in eq.(6.12). We also assume that 721 ~ 0. This gives 



P31 



1 

2 A 



45 



"§731 



(6.13) 



In terms of the refractive index, this is: 
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The dependence on the probe frequency uja = W31 — Ai, for fixed wg, is sketched in 
Fig. 7. The main feature in Im ua(uja) is a double absorption line with an intermediate 
region around lua — wb + u;2i (5 = 0) where Im ha(uja) — 0. At 5 = 0, the imaginary 
part vanishes so there is no absorption. This is electromagnetically-induced transparency. 



- 34 - 



Re n 



Im n 




Figure 7: The real (left figure) and imaginary (right figure) parts of the refractive index in a 
A-system exhibiting electromagnetically-induced transparency (E1T). 

However, although there is near complete transparency in this frequency region, there 
is significant dispersion - the refractive index Re ua(u>a) rises steeply (normal dispersion) 
and nearly linearly. Recalling that the group velocity is given by 

/ din \ ~~ i 

v gp (uj) = \ aJrUJ ~^ J ) (6.15) 

a large positive slope dn/dco corresponds to a small group velocity. Linearity implies 
negligible pulse distortion. This is the phenomenon of 'slow light', which has allowed 
group velocities as low as 17 ins -1 to be achieved in ultracold Na atom cells [39]. 

6.2 Raman gain lines and gain-assisted anomalous dispersion 

To describe the Raman gain scenario, we reverse the roles of the lasers so that now A 
corresponds to the coupling field and B is the probe. This time, therefore, we are interested 
in the refractive index ub(ujb) given by the reduced coherence P32 in eq.(6.12). In the 
relevant experiment [14, 42], we are concerned with the two-photon Raman transition 
|1) — » 1 2) via the intermediate state ~ |3), so laser A is interpreted as the Raman pump 
and B the Raman probe field. 

The relevant approximation here is that (while \£Ia\ 3> \^b\) both fields are weak in 
the sense |f2 J 4| 2 /A25 <C 1 and |J1b| 2 /Ai<5 <C 1, while for such Raman experiments we can 
neglect the width of the state |3) so that 731 ~ 732 ~ 0. We also assume the common 
detuning A = (Ai + A2V2 is much greater than the differential detuning 5 = A\ — A2. 

With these simplifications, eq.(6.12) reduces to 

P32 = ^xr rr-r ( 6 - 16 ) 



8A 2 (5 + 1721 



The corresponding refractive index is 



Re n B {uJB) = 1 + ^—|^23 



Seo'^' A 2 <5 2 + i 7 2i 
16e A 2 S 2 + 
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The refractive index is sketched as a function of = 1^32 — A2, for fixed u>a, in Fig- 8. 




w 



Figure 8: The real (left) and imaginary (right) parts of the refractive index for the single Raman 
gain line given by eq.(6.17). This illustrates how it is possible to have u p h(oo) < w p h(0) in a system 
with Im n(u>) < 0. 

Here, the physics is quite different. This time we have Im tib(wb) < 0) indicating gain 
rather than absorption. The single Raman gain line is centred onwg ~ uja — W21 (5 = 0). 
For low frequencies, the refractive index Re ub(^b — 0) is less than 1, corresponding to 
a super luminal phase velocity ^(wb ~ 0) > 1. For large ujb, the phase velocity tends 
to 1 as usual, f p h(oo) = 1. This behaviour of the refractive index and phase velocity is 
therefore compatible with the KK dispersion relation with Im ub(^b) < 0. 

The Raman gain line therefore exhibits precisely the behaviour we speculate is neces- 
sary to resolve the potential paradox associated with the superluminal phase velocity in 
QED in a gravitational field. The essential feature is that the coupled laser-atom A-system 
in this case exhibits gain rather than absorption, i.e. Im n(u) < rather than the more 
familiar Im n(u) > 0. Unlike the situation with absorption, where photons are scattered 
out of the probe beam, in the case of gain the intensity of the probe beam is increased as it 
passes through the laser-atom medium. This amplification of the electric field of the probe 
laser is due to the additional photons in the probe beam coming from the second stage of 
the Raman transition. It is maximised when the differential detuning 6 ~ 0. 

The crucial point for our considerations is that it is possible to have t> p h(oo) < t> p h(0) 
provided Im n(uS) < 0, which means that the light must gain in intensity due to its passage 
through the medium. In the next section, we discuss whether this physical mechanism 
could be realised in the very different scenario of light propagation in gravitational fields. 

Before leaving this section, however, we briefly mention an intriguing experimental 
application of the Raman gain scenario. This involves a system with two Raman pump 
fields with slightly different frequencies. This produces a gain doublet, with refractive index 
as sketched in Fig. 9. 

The region between the two gain lines has Im n(u) ~ 0, so there is virtually no gain, 
or absorption, and once again the system is transparent to the probe light. The refractive 
index in this region is steeply falling, and can be made approximately linear as shown - we 
therefore have a frequency range exhibiting transparency with linear anomalous dispersion. 
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Figure 9: The real (left) and imaginary (right) parts of the refractive index for the Raman gain 
doublet, exhibiting the phenomenon of gain- assisted anomalous dispersion and negative group ve- 
locity. 

From eq.(6.15), we see that the group velocity in this region may be made extremely large 
and even, if the slope dn/du) is sufficiently large, we may find v gp negative. 

Remarkably, the phenomenon of a negative group velocity with negligible pulse distor- 
tion has been realised experimentally [14, 42, 43]. The set-up for these experiments uses 
a A-system where the energy levels shown in Fig. 6 are hyperfine levels of Cs atoms with 
|1) = 6S 1/2 \F = 4,m = -4), |2) = 6S 1/2 \F = 4,m = -2) and |3) = 6P 3/2 |F = 4,m = -3) 
with the probe laser close to resonance with the Cs D 2 line. A laser pulse was passed 
through the Cs vapour cell with a group velocity of —0.003c, so that it exits the cell before 
entering it. For more details of this intriguing experiment, see refs.[14, 42, 43]. 

7. Superluminality and UV completion 

In this paper, we have analysed the propagation of light and the dispersion relation for the 
refractive index in three distinct quantum systems. For QED in a background electromag- 
netic field, we have used a recent evaluation of the one-loop vacuum polarisation to plot 
both the real and imaginary parts of n(cj) over the full frequency range. Expressing the vac- 
uum polarisation in the Newman- Penrose formalism allowed an elegant identification of the 
role of the null energy projection T^ u k^k u , clarifying its relation to the possible occurrence 
of superluminal propagation. For QED in a gravitational field, only the low-frequency range 
of the refractive index is currently well understood. We reviewed and extended some of 
our earlier work in this field, demonstrating the occurrence of a superluminal low- frequency 
phase velocity, v p h(0) > 1, for certain spacetimes. This was illustrated with examples of 
both Ricci-flat and Weyl-flat spacetimes, the former exhibiting gravitational birefringence 
as well as superluminality. Finally, we gave a unified treatment of propagation in cou- 
pled laser-atom A-systems exhibiting either electromagnetically-induced transparency with 
'slow light' or Raman gain lines, where gain-assisted anomalous dispersion permitted the 
remarkable and experimentally observed phenomenon of a negative group velocity. 

The purpose of drawing these various examples together here is to shed light on the 
proposal that the fundamental axioms of local quantum field theory or string theory imply 



-37- 



constraints on the couplings of the corresponding IR effective field theory. That is, does 
the requirement that an effective field theory should have a consistent UV completion 
necessarily imply positivity constraints on the couplings of the leading irrelevant operators, 
with implications for the phenomenology of the low-energy theory? 

In the case of QED in flat spacetime, this proposal works beautifully. As we explained, 
causality itself only requires the UV limit of the phase velocity to be subluminal, v p h(oo) < 
1. But provided the imaginary part of the refractive index is positive, which we confirmed 
by an explicit evaluation showing Im n{uo) has a single absorption line in a background 
electromagnetic field, the KK dispersion relation ensures f p h(0) < t> p h(oo). So causality 
does indeed require f p h(0) < 1, which implies positivity constraints on the leading irrelevant 
operators of the low-energy effective theory for QED, viz. the Euler-Heisenberg effective 
action. Of course, explicit perturbative calculations confirm that these are satisfied. 

QED in curved spacetime, on the other hand, presents a serious problem. Here we 
find examples where the low-energy effective theory has a superluminal phase velocity, 
v ph(0) > 1. At first sight, this appears to be incompatible with causality. We identified 
three ways in which causality could be maintained: first, that an extension of the notion 
of 'stable causality' may permit v p h(oo) > I without causal violations; second, that the 
KK dispersion relation is not valid in its standard form for QFT in curved spacetime; and 
third, that at least in gravitational theories, Im n{uj) (and by extension Im A4(s, 0)) may be 
negative. To explore the plausibility of this last option, we showed how a negative Im n(uj) 
arises in Raman A-systems, where it is associated with gain rather than absorption. We 
have not found a similar example of Im n(w) < in a non- gravitational quantum field 
theory and speculate that its occurrence for QED in curved spacetime may be related to 
the ability of gravitational interactions to focus geodesies and amplify light waves. 

We do not yet know with certainty which of these is the physically realised option. 
However, each of them would invalidate the use of u p h(0) < 1 (or related analyticity con- 
straints on scattering amplitudes M(s,t)) as a criterion constraining the couplings of the 
IR effective field theory, since we must allow the possibility that the UV completion may 
involve gravity. This would appear to undermine the use of the positivity constraints as 
restrictions on low-energy phenomenology, though with better understanding they may 
still provide important information on the nature of the UV theory. In any case, under- 
standing how to accommodate a superluminal low-frequency phase velocity with causality 
in quantum field theory is now an urgent problem whose resolution will surely shed light 
on the quantum theory of gravity. 
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A. Appendix: Electrodynamics in Newman-Penrose formalism 



We collect here some formulae involving the Newman-Penrose formalism which are used 
in the main text. A useful review is contained in ref.[31]. The essential feature is the 
introduction of a null tetrad with vierbeins e A where the index l A' refers to four null 
vectors C^,n^,m^,m^. These are chosen so that their only non-vanishing scalar products 
are £.n = 1 and m.fh = — 1. In this basis, the metric is therefore 



VAB = eA e B9lxv 



/0 1 

1 



\0 -1 



\ 


-1 

/ 



(A.l) 



It follows that 

= (£»n v ) - {m^m u ) (A.2) 

with notation (ab) = ab + ba. We also use [ab] = ab — ba. It follows that the trace in the 
null basis is trM A B = r) BA M A B = M {£n) - M {mfh) . 

To describe the electromagnetic field strengths, we introduce three complex scalars 
00, 0i, 02 defined as 



0i = \F uv {E< t n v + mi t m v ) 



b 2 = F ilv m^n v 



(A.3) 



Inverting this, we recover the formula for F^ u quoted in eq.(2.20), viz. 



F^ = - (0i + <t>\Wn v ] + (0i - 0* ) Km"] + 02 [#W] + <j>* 2 [tm v ] - (j)* [n V] - O [n^m u ] 

(A.4) 

In terms of the vierbein basis, this is 
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(A.5) 



It is useful to write explicit expressions for the 
E and B in flat spacetime. Defining 



in terms of the usual field strengths 



/ F\ E2 E3 ^ 

—Ei -B3 — B2 

—E2 — -B3 B\ 

\-E 3 B 2 -B ± / 



(A.6) 



15 In flat spacetime, a standard representation of these vectors would be £ M = -^(1,0,0,1), n M = 
-^=(1,0,0,-1), m M = ^75(0,1, 2,0), m M = ^=(0, 1, -i,0). This is the origin of the factor V% when we 
express the wave- vector as fc M = \/2lo£^, identifying u> as the frequency (see section 2). We use a metric 
g^v with signature (+,—,—,—). 
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we find 



E x = Re (0 O - <h) 
E 2 = Im (0o + 02) 
E 3 = - 2Re 0i 



B 1 = - Im (0o - 2 ) 
B 2 = Re (0o + 2 ) 
B 3 = 21m 0i 



(A.7) 
(A.8) 



and the inverse relations 



= \({E 1 + B 2 )+i{E 2 -B 1 )) 



-(E 3 -iB 3 ) 



B 2 ) + i(E 2 + B 1 )^j 



(A.9) 



The corresponding results for the dual field strength tensor F uv = \e nu \ p F Xp are 

= -i(0i -0*) [tn v \ +*'(0i +01 ) [m^m v ] +i(f> 2 [tm^-ic^ [tm v ] +i05 [n V] -ifa [n^m"} 

(A.10) 

that is, 
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We also need expressions for the NP components of the squares of the field strengths 
(F 2 ) uu , (FF) nu and (FF) nu . In matrix notation, we find 

/ 20005 (F 2 )in 20001 20105 \ 

(F 2 ) n e 20 2 0* 2 20i0^ 20201 

20001 20i0* 2 20o02~ {F 2 ) mif , 

V 20105 2020* {F 2 ) mrn 20205 / 
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(A.14) 
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where here (FF)i n = (FF) 



nt 



The Lorentz invariants are 

f = \f^f^ 



g = \f^f^ 



(0! - ft Y + O 2 + 0o>2 and (FF) r 



{FF), 



2 B 2x 



2 Re (0o02 - 0?) 
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and so = \ [T + i£7) 



— 0f . We also note the useful identity 
(FF)vv ~ (FF)^ = 1T 9ilv 



(A.15) 



(A.16) 



The energy-momentum tensor is defined in pure Maxwell electrodynamics as 

T/xv = ~~ F^\F U + —g^uF 

In terms of the NP field strength scalars, this can be written as 

Tpv = 2^0101^^) + 0i0i (m^rhv) - 2 0K^ m ^) ~ <\>2<\>\{£y.™v) ~ 0o<M r V m ^) 
-0o0i [n^rhy) + 0202 + 0o0o n /i n ^ + 020o"V m ^ + 0200%^ 



(A.17) 



(A.18) 



or rather more concisely, 



Tab 



/0O0S 0101 0001 0100 \ 

101 0202 0102 0201 

102 0002 0101 

201 0101 020Q / 



Po9l 

\0105 



(A.19) 



This factorisation property is the origin of many of the simplifications which follow from 
the NP formalism for electrodynamics. 
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